Assessing the Effects of Communication Faults on Parallel Applications1
نویسندگان
چکیده
This paper addresses the problem of injection of faults in the communication system of disjoint memory parallel computers and presents fault injection results showing that 5% to 30% of the faults injected in the communication subsystem of a commercial parallel computer caused undetected errors that lead the application to generate erroneous results. All these cases correspond to situations in which it would be virtually impossible to detect that the benchmark output was erroneous, as the size of the results file was plausible and no system errors had been detected. This emphasises the need for fault tolerant techniques in parallel systems in order to achieve confidence in the application results. This is especially true in massively parallel computers, as the probability of occurring faults increase with the number of processing nodes. Moreover, in disjoint memory computers, which is the most popular and scalable parallel architecture, the communication subsystem plays an important role, and is also very prone to errors. CSFI (Communication Software Fault Injector) is a versatile tool to inject communication faults in parallel computers. Faults injected with CSFI directly emulate communication faults and spurious messages generated by non fail-silent nodes by software, allowing the evaluation of the impact of faults in parallel systems, and the assessment of fault tolerant techniques. The use of CSFI is nearly transparent to the target application as it only requires minor adaptations. Deterministic faults of different nature can be injected without user intervention and fault injection results are collected automatically by CSFI.
منابع مشابه
Comparison of the Eccentricity Faults Effects on the Performance of several Toroidal Wounded Axial Flux Permanent Magnet Motors
Eccentricity fault is one the most common fault types of disk-type permanent magnet machines, which could lead to devastating effects. Unfortunately, most of the previous works have studied this fault and its detection techniques for slotted structure with common winding. Therefore, in this paper, the effects of eccentricity faults on the performance of single-sided slotted, single-sided slotle...
متن کاملThe scrutiny of geomorphologic effects of Armaghankhane and Taham faults
One of the unique properties of northern landforms of zanjanrood catchment is having smooth surfaces that have been interrupted by deep valleys. Rivers that don’t have a wide catchment upper their front mount are running in parallel deep valleys that the topographical situations don’t let them to receive around surface runoffs. This situation has made them to move in parallel form and not to jo...
متن کاملA new strategy for controlling wind turbines against sensor faults and wake effects to harvest more electrical energy
This paper describes a new method for harvesting maximum electrical energy in wind farms. In proposing technique, the stochastic process principles are applied for detecting fault measurements of sensors. On the other hand, the wind farm is modeled by using fuzzy concept. Thereby the turbines are controlled against continuous changes in speed, direction and eddy currents of the blowing wind. To...
متن کاملOn Feasibility of Adaptive Level Hardware Evolution for Emergent Fault Tolerant Communication
A permanent physical fault in communication lines usually leads to a failure. The feasibility of evolution of a self organized communication is studied in this paper to defeat this problem. In this case a communication protocol may emerge between blocks and also can adapt itself to environmental changes like physical faults and defects. In spite of faults, blocks may continue to function since ...
متن کاملCommunication and Affective Variables Influencing Omani EFL Learners’ Willingness to Communicate
This study examined Omani EFL learners’ perceptions toward their willingness to communicate (WTC) in English. To this end, 204 students majoring in English language at a private university in Oman were assigned a questionnaire adapted from McCroskey’s (1992) WTC scale to determine possible effects of communication and affective variables on their WTC in English. After assessing the normality di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995